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Full subgraphs 


Victor Falgas-Ravry* Klas MarkstronV Jacques Verstraete^ 


Abstract 

Let G = (V E) be a graph of density p on n vertices. Following Erdos, Luczak and 
Spencer, an m-vertex subgraph iT of G is called full if H has minimum degree at least 
p{m—l). Let /(G) denote the order of a largest full subgraph of G. ^^( 2 ) is a non-negative 
integer, define 

/(n,p) = min{/(G) : |U(G)| = n, |£;(G)| = pQ)}. 

Erdos, Luczak and Spencer proved that for n > 2, 

( 2 n )2 — 2 < f{n, p < 4n^(logn)V 

In this paper, we prove the following lower bound: for n~^ < < 1 ~ 

f{n,p) > 1(1 - 1. 

Furthermore we show that this is tight up to a multiplicative constant factor for infinitely 
many p near the elements of {p In contrast, we show that for any n-vertex 

graph G, either G or G° contains a full subgraph on fl(p^) vertices. Finally, we discuss 
full subgraphs of random and pseudo-random graphs, and several open problems. 


1 Introduction 

1.1 Full subgraphs 

A full subgraph of a graph G of density p is an m-vertex subgraph H of minimum degree at 
least p{m — l). This notion was introduced by Erdos, Luczak and Spencer Q]. We may think of 
a full subgraph as a particularly ‘rich’ subgraph, with ‘unusually high’ minimum degree: if we 
select m vertices of G uniformly at random, then the expected average degree of the subgraph 
they induce is exactly p{m — 1), which is the minimum degree we require for the subgraph to 
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be full. One cannot in general expect to find m-vertex subgraphs of higher minimum degree, 
as may be seen for example by considering complete multipartite graphs with parts of equal 
sizes. Fixing p and n, one may ask for the largest m such that every n-vertex graph G has a 
full subgraph with m vertices. For a graph G, let f{G) denote the largest number of vertices 
in a full subgraph of G. If p{^ is a non-negative integer, define 

/(n,p) =min{/(G) : \V{G)\ = n,\E{G)\ =pQ)}. 

Erdos, Luczak and Spencer raised a problem which, in our terminology, amounts to deter¬ 
mining f{n,p) when p = ^, and showed ( 2 n )2 — 2 < /(n, < {2 + -^)n^(\ogn)^. In this 

paper, we prove the following theorem for general p, improving the lower bound of [^: 

2 1 

Theorem 1. For all p = Pn such that n“3 < p^ < 1 — n~T, 

f{n,p) > i(l -p)3n^ - 1. 

Moreover for each c> 1, if p = + cn~^ for some r G N, then f{n,p) = 0(n3). 

We also show in Section [3] (after the proof of Theorem[2]) that if p < n~i then \f{n,p)—p^n\ < 
1. A case of particular interest is p = where Theorem [1] together with the results of Erdos, 
Luczak and Spencer gives 

- 1 < /(n, i) < (^2 + (log n)i 

A similar construction to that of Erdos, Luczak and Spencer shows that /(u,p) is within a 
logarithmic factor of when p G The order of magnitude of /(n,p) is not 

known in general, and we pose the following problem: 

Problem 1. For each fixed p G (0,1), determine the order of magnitude of f{n,p). 

It may also be interesting to determine the order of magnitude of the minimum possible value 
of /(G) when G is from a certain class of n-vertex graph of density p, such as AT^-free graphs 
or graphs of chromatic number at most r. 

1.2 Discrepancy and full subgraphs 

Eull subgraphs are related to subgraphs of large positive discrepancy. For a graph G of density 
p and an m-vertex set X C E(G), let 5(A1) = e(X) —p(^), where e{X) denotes the number of 
edges of G that lie in X. The positive and negative discrepancy of G are respectively defined 
by 

disc“'“(G) = max 5(A") disc“(G) = max (—5(A)). 
xcy(G) xcy(G) 

The discrepancy of G is disc(G) = max{disc“'“(G), disc“(G)}. We prove the following simple 
bound via a greedy algorithm in Section [3l relating the positive discrepancy to the order of a 
largest full subgraph: 
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Theorem 2. Let G be a graph of density p with disc'^{G) = a > 0. Then 


f{G) >{l-p) 2(2a)i 


Theorem [2] is best possible, since the graph G consisting of a clique with edges and n — m 
isolated vertices has f{G) = m and 


di.c+(G) = (”) (l - §) . 


= m. On the other hand, if G 


whereas the bound given by Theorem[2]is /(G) > 

4 

is any n-vertex graph obtained by adding or removing o(n3) edges in a complete multipartite 
graph with a bounded number of parts of equal size, then disc"''(G) = o(n3) and the lower 
bound in Theorem [2] is superseded by Theorem [TJ 


That there should be a relation between full subgraphs (which have unexpectedly high mini¬ 
mum degree) and subgraphs with large positive discrepancy (which have unexpectedly many 
edges) is not surprising. Indeed, an easy observation is that any subgraph maximising the 
positive discrepancy must be a full subgraph (see Lemma [9]). 


1.3 Random and pseudo-random graphs 


In a random or pseudo-random setting, we are able to improve our bounds on the size of a 


largest full subgraph by drawing on previous work on discre 


jancy and jumbledness. Jum- 


bledness was introduced in a seminal paper of Thomason [2^ as a measure of the ‘pseudo¬ 
randomness’ of a graph. 


Definition. A graph G is {p, j)-jumbled if for every X C V{G), |(5p(X)| < j\X\ 


We prove that graphs which are ‘well-jumbled’ — meaning that they are (p, j)-jumbled for 
some small j, and so look ‘random-like’ — have large full subgraphs. 


Theorem 3. Suppose G is a {p, j)-jumbled graph of density p. Then 


/(G) > 


disc“''(G) 

3 


This result, which for small values of j improves on Theorems [T] and [21 is proved in Section SI 

For random graphs, Erdos, Luczak and Spencer showed that for p = f{Gn,p) > /3in—o(n) 
asymptotically almost surely, where /3i ~ 0.227. Using Theorem |3l we can extend this linear 
lower bound to arbitrary, fixed p £ (0,1). Erdos and Spencer Q] proved that for p = ^ we 
have asymptotically almost surely 

disc+(Gn,p) = 0(p^(l-p)^n^), (1) 
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and that the same bound holds for disc“(G„^p). By extending their arguments, it is eas¬ 
ily shown that ([1]) holds for arbitrary p S (0,1). Further it is well-known that G = Gn,p 
asymptotically almost surely has |(5p(X)| = — p)n\X\) for all X C V{Gn,p) (see for 

example [1^), so that Gn,p is (p, j)-jumbled for some j = 0{y/p{l — p)n). Combining this 


with (JH) and Theorem [3l we obtain that for fixed p £ (0,1) asymptotically almost surely, 


f{Gn,p) = n(n). 


( 2 ) 


In the other direction, results of Riordan and Selby 1^ imply that for all fixed p £ (0,1), 
f{Gn,p) < f^ 2 n + o{n) asymptotically almost surely, where (32 ~ 0.851.... We believe that 
f{Gn,p) is concentrated around f3n + o(n) for some function (3 = /3p, and pose the following 
problem. 


Problem 2. For each fixed p £ (0,1), prove the existence and determine the value of a real 
number (3 = (dp such that for all <5 > 0, F{\f{Gn,p) — (3pn\ > 6n) —>-0 as n ^ oo. 


1.4 Full and co-full subgraphs 


We also a consider a variant of our problem with a Ramsey-theoretic flavour. A subgraph 
H of a graph G is co-full if V{H) induces a full subgraph of G^, the complement of G. 
Equivalently, an induced m-vertex subgraph H of a graph G with density p is co-full if it has 
maximum degree at most p(m — 1). Let g{G) be the largest integer m such that G has a 
full subgraph with at least m vertices or a co-full subgraph with at least m vertices. In other 
words, g{G) = max{/(G),/(G^)}. Setting g{n) = min{g(G) ; |F(G)| = re}, we prove the 
following theorem: 


Theorem 4. There exist constants ci,C 2 > 0 such that 


Cl 


re- 

log re 


< g{n) < C 2 


re log log re 
log re 


Bounding g{G) is related tm but distinct from, a problem of Erdos and Pach [9(] on quasi- 
Ramsey numbers (see also [ij]). Erdos and Pach Q] showed that for every re-vertex graph G, 
in either G or G‘^ there exists a subgraph with rre = ll(i^^) vertices and minimum degree at 
least ^(rre — 1). In particular, when G has density this shows g{G) = Erdos and 

Pach in addition gave an unusual weighted random graph construction G' to show their quasi- 
Ramsey bound was sharp up to a log log re factor. While G' does not have density a simple 
modification (see Section [5?^ gives a graph G* of density ^ such that g{G*) = 
and this gives the upper bound in Theorem [H This leaves the following problem open: 

Problem 3. Determine the order of magnitude of g{n). 


By ([2]) with p = note that g{G) is linear in re for almost all re-vertex graphs G. We may also 
define g{n,p) = min{( 7 (G) : |F(G)| = re, |F(G)| = ^( 2 )}! ask for the order of magnitude 
of g{n,p). Note Theorem H] gives g{n,p) = fl(i^;^) for all p. 
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1.5 Relatively half-full subgraphs 

If G is a graph, then a relatively half-full subgraph of G is a subgraph R of G such that 
dniv) > ^dciv) for every v € V{H). A key ingredient in the proof of Theorem [1] is the 
following theorem on relatively half-full subgraphs: 

Theorem 5. Let G he an n-vertex graph. Then G contains a relatively half-full subgraph with 
LfJ or LfJ + 1 vertices. 

Theorem [5] is best possible, in the sense that the smallest non-empty relatively half-full sub¬ 
graph of Kn has -|- 1 vertices and the smallest relatively half-full subgraph of -An,n has 
n -|- 1 vertices when n is odd. For regular graphs, we obtain: 

Corollary 6 . Let G be an n-vertex d-regular graph. Then G contains a full subgraph with 
Lf J or -|- 1 vertices. 

(Note that of course G itself is full, as it is regular.) When d is very small relative to n, Alon [l| 
showed that any d-regular n-vertex graph contains a subgraph on vertices in which the 
minimum degree is at least -|- cd^, exceeding the requirement for a full subgraph by an 
additive factor of cd^. However, as observed by Alon [l|, such a result does not hold for large 
d, as for example complete graphs and complete bipartite graphs show. 

1.6 Relatively g-full subgraphs 

Let q G [0,1]. A subgraph R of a graph G is relatively q-full if dniv) > qdciv) for all 
V G V{H). We prove: 

Theorem 7. Let G be a graph on n vertices. Then for every 5 G [0,1], G contains one of the 
following: 

(i) a relatively q-full subgraph on \qn~\ vertices, or 

(ii) a relatively (1 — q)-full subgraph on [(1 — q)n\ vertices, or 

(hi) a relatively q-full subgraph on [gn] -|- 1 vertices and a relatively (1 — q)-full subgraph on 
[(1 — ( 7 )nJ -|- 1 vertices. 

Using Theorem [TJ we prove Theorem [5] and an extension to relatively y-full subgraphs for 
r > 3: 

Theorem 8 . Let G be a graph on n vertices, and let r G N. Then G contains a relatively 
I-full subgraph on [j\, \j] or \j] -\-l vertices. 

Theorem [ 8 ] is best possible in the following sense: if r > 3, consider the complete graph Kn 
for some n > r -|- 2 with n = 2 mod r. A smallest non-empty relatively y-full subgraph of Kn 
has exactly -|- 1 = j"^] -|- 1 vertices. 

It is natural to ask whether Theorem [ 8 ] can be extended further to cover other q. 


5 


Problem 4. Determine whether there exists a constant c such that for every q G [0, ^], every 
n-vertex graph G has a relatively q-full subgraph with at least [qn\ vertices and at most [qn\ +c 
vertices. 

For g > ^, a cycle of length n shows that there exist n-vertex graphs with no non-empty 
relatively g-full subgraphs on fewer than n vertices. We might try to circumvent this example 
by requiring a weaker degree condition: define a subgraph H oi a graph G to be weakly 
relatively q-full if dniy) > YqdG{v)\ for all v G V{H). However even for this notion of q- 
fullness a natural generalisation of Theorem [8] fails for rational g > ^: consider the second 
power of a cycle of length n. If x is a vertex in a weakly relatively |-full subgraph H, then all 
but at most one of its neighbours must also belong to H. Thus vertices not in H must lie at 
distance at least 5 apart in the original cycle, and H must contain at least |n vertices, rather 
than the |n -|- 0(1) we might have hoped for. It would be interesting to determine whether 
powers of paths or cycles provide us with the worst-case scenario for finding weakly relatively 
g-full subgraphs when q> \. 

Problem 5. Let g G (|, !)• Determine whether there exist a constant Cq < 1 such that every 
n-vertex graph G has a weakly relatively q-full m-vertex subgraph where [gnj < m < CqU. 

1.7 Notation 

We use standard graph theoretic notation. In particular, if X, Y are sets of vertices of a graph 
G = (H, E), then e{X) denotes the number of edges in the subgraph G[X] of G induced by X, 
e{G) is the number of edges in G, and e(X, Y) is the number of edges with one end in X and 
the other end in Y. Denote by dx{x) the number of neighbours in X of a vertex x G V{G). It 
is convenient to let 6p{X) = e{X) — p(}^^) when X C V{G). The Erdos-Renyi random graph 
with edge-probability p on n vertices is denoted by Gn,p. If (^n)neN is a sequence of events, 
then we say occurs asymptotically almost surely if lim^^oo = 1- 

2 Relatively g-full subgraphs : proofs of Theorems [5] — [8] 

Proof of Theorem\^ Let G be a graph on n vertices, and let g G [0,1] be fixed. Let X UY 

be a bipartition of V{G) with |X| = [gn] and |T| = [(1 — g)nj maximising the value of 
{I - q)e{X) + qe{Y) ;= M. 

If X is relatively g-full or Y is relatively (1 — g)-full, then we are done. Otherwise there 
exist x £ X and y £ Y with dx{x) < \qdG{x)~\ — 1 and dyiy) < [(1 — Q)dG{y)~\ — 1- Let 
X' = (X \ {x}) U {y} and W = (T \ {y}) U {x}, and let Ixy = 1 if {x, y} £ E{G) and \xy = 0 
otherwise. Write dG{x) = d{x) for x G V{G) and set M' = (1 — g)e(X') -|- qeiY'). Then 

M' = M + {I - q)dx{y) - qdviy) + qdyix) - {I - q)dx{x)-Ixy 
= M + {l- q)d{y) - dyiv) + qd{x) - dx{x) - Ixy 
> M -t (^(1 - q)d{y) - ((1 - q)d{y)']J + (qd{x) - Iqdix)]^ -£2- Ixy. 
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Since X UY maximised (1 — q)e{X) + qe{Y) over all bipartitions with |X| = \qn\, |y| = 
[(1 — q)n\, M' < M and we deduce from the inequality above that 

(a) dx{x) = \qd{xy\ - 1 and dyiv) = [(1 - g)d(y)l - 1- 

(b) {x,y}^E{G). 

(c) (1 -q)d{y) < [(1 -q)d{yy\ and qd{x) < \qd{xy\. 

Now let Bx denote the set of x € X with dx{x) < \qd{xy\ — 1 and By the set of y € X 
with dy{y) < [(1 — q)d{y)'\ — 1. By our assumption, both sets are non-empty. By (b) above, 
Bx U By induces a complete bipartite subgraph of G. Thus by (a) we have that for every 
x € Bx,y S By, X U {y} is a relatively (;-full subgraph on \qn] + 1 vertices and Y U {x} is a 
relatively (1 — g)-full subgraph on [(1 — q)n\ + 1 vertices. □ 

Proof of Theorem O Apply Theorem [3 with q = □ 


Proof of Corollary Suppose at least one of n,d is odd. By Theorem [5l every d-regular 
graph has an m-vertex subgraph H with m G + 1} such that dniv) > Tf] for 

every v G V{H). Since for n,d not both even 


-d- 

2 


r d 

n ~ 

n — 1 

121 


> 


- d 
n — 1 


(m 


1 ) 


the subgraph P[ is a full subgraph. 

In the case where both n and d are even, we need to use a slightly stronger form of Theorem [5l 
In the particular case where G is d-regular with d even and q = ^, condition (c) in the proof 
of Theorem [3 cannot be satisfied, and in particular one of the alternatives (i) or (ii) must hold 
in Theorem [3 Thus G must contain a subgraph H on ^ vertices with minimum degree at 
least |, which is a full subgraph. □ 


Proof of Theorem\^ We use Theorem [3 and induction on r. The base case r = 1 is trivial, 
and Theorem [5] deals with the case r = 2. Now apply Theorem [3 with q = \'. given a graph 
G on n vertices, this gives us a ^-full subgraph on or [^] + 1 vertices (alternatives (i) 
and (hi)) or an ^^-full subgraph H on vertices (alternative (h)). In the latter case, 

we use our inductive hypothesis to find a ^^^-full subgraph H' ol Pd on m vertices, for some 
Txi '■ [yj < m < [y] -|- I. The subgraph H' is easily seen to be a y-full subgraph of G, and so 
we are done. □ 


3 A greedy algorithm : proof of Theorem [2] 

A natural strategy for obtaining a full subgraph in a graph G of density p on n vertices is 
to repeatedly remove vertices of relatively low degree. When there are i vertices left in the 
graph, such a greedy algorithm finds a vertex of degree at most \p{i — 1 )] — 1 and deletes that 
vertex, unless no such vertex exists, in which case the i vertices induce a full subgraph. If G 
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has positive discrepancy a, then we apply this algorithm in a subgraph H on m vertices with 
e{H) > p(^) + a to obtain Theorem [2l 

Proof of Theoreml^ If G has positive discrepancy a > 0, then its density p is strictly less 
than 1. Let iL be a subgraph of G with m vertices such that e{H) = p(^) + a. At stage i 
we delete a vertex of degree at most \p{m — i)] — 1 in the remaining graph, or stop if no such 
vertex exists. The number of edges remaining after stage i is at least 

/ / m\ f m\ fm — i\ fm — i\ 

Therefore the greedy algorithm must terminate with a full subgraph onm — i vertices for some 
i satisfying (1 — p)(™ 2 ~*) — conclude /(G) > m — i > (1 — p)~^ {2a)^. □ 

An alternate proof may be obtained by appealing to Lemma [9l which states that a subgraph 
attaining the maximum positive discrepancy must be full. The example of a clique with m 
vertices and n — m isolated vertices which shows that Theorem [2] is tight is the same example 

which shows f{n,p) = 0{p^n) for p < n~^. We now prove that \f{n,p) — pin\ < 1 for 

. _2 
p <n 3. 

Proof that \ f{n,p) — p^n\ < 1 for p < n“3. First we show f{n,p) < pin + 1 for all p : 0 < 

p < 1. If m is defined by < p( 2 ) ^ (™)) then the n-vertex graph G consisting of a 

subgraph of a clique of size m with edges, together with n — m isolated vertices has 

/(G) < m < pin + 1. Next we show that every n-vertex graph G of density p has a full 

1 2 

subgraph with at least p^n — 1 vertices if p < n“3. Remove all isolated vertices from G. The 
number of isolated vertices is clearly at most n — pin, otherwise the remaining graph has p(^'j 
edges and fewer than pin vertices, which is impossible since this is denser than a complete 
graph. So we have a subgraph H with at least pin vertices and p{^ edges with no isolated 
vertices. Clearly H has a subgraph of minimum degree at least 1 with at least pin—1 vertices 
and at most pin + 1 vertices, since the removal of a leaf in a spanning forest creates at most 
one new isolated vertex. This subgraph is full since 

|'p(p3n -1-1 — 1)] = [p^n] < 1 

when p < n“3, as required. □ 

Remarks. The analysis of the greedy algorithm in the proof of Theorem [2] above is not 
optimal; in fact by considering the asymptotic behavior of 

(j) = lim inf — 

n—^cxD 77 , 

2 = 1 


^(p(n -i) + l- \p{n - i)]). 
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and performing our greedy algorithm directly on G rather than on a maximum discrepancy 
subgraph it follows that 


f{n,p) > < 


J 


1-p 


if p is rational with denominator q > 1 
if p < 1 — e for some e > 0 and p> 


For instance if say ^ —o(l) < then /p„(re) > — as shown by Erdos, Luczak 

and Spencer [^. These lower bounds on f{n,p) will be superseded by the better bounds given 
in Theorem [TJ 


We note that there exist examples of n-vertex graphs with density p = | + o(l) where greedily 
removing a vertex of minimal degree could yield a full subgraph of order only 0{^/n). Consider 
the graph G on V = {0,1, ....,4n + 1} obtained by taking the power of the Hamiltonian 
cycle through 0,1, 2,..., 4n + 1, adding edges between all antipodal pairs {i,i + (2n + 1)} 
(with addition modulo 4n + 2), and adding a complete bipartite graph Km,m with parts 
{0,1,m—1} and {2n+l, 2n+2,2ra+m}, where m = (3n)^+0(l). It is an easy exercise to 
show that by removing antipodal pairs of minimum degree vertices a greedy algorithm could 
fail to find a full subgraph until it has stripped the graph down to the planted complete 
bipartite graph Km,m- 


4 Jumbledness: proof of Theorem [3] 


As our arguments involve passing to subgraphs with different edge densities, it shall be useful 
to adapt our notion of full subgraphs, discrepancy and jumbledness as follows. 

Let G = (F, E) be a graph. An induced subgraph iL of G on m vertices is called p-full if its 
minimum degree is at least p{m — 1), and is called p-co-full if its maximum degree is at most 
p{m — 1). Let fp{G) be the largest number of vertices in a p-full subgraph of G, and let gp{G) 
be the largest size of a p-full or p-co-full subgraph of G. If p happens to be the density of 
G, then in fact fp{G) = f{G) and gp{G) = g{G). Determining the smallest possible values 
of fp{G) and gp{G) given the number of edges and number of vertices in G can be viewed as 
generalisations of the Turan and of the Ramsey problems, respectively, which comprise the 
case p = 1; see L^ and the references therein. 

The positive p-discrepancy of G is defined to be disCp(G) = maxxcv 5p(-T). The negative 
p-discrepancy of G is disc“(G) = max^cy(— (5p(Ai)). The p-discrepancy of G is disCp(G) = 
max (disCp (G),disc“(G)). Finally, the p-jumbledness of G is 


jp{G) = max 




xcy \X\ 

We begin with the following simple observation: 

Lemma 9. Let X he a subset of V{G) such that Sp{X) = disCp (G). Then G[X] is p-full. 
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Proof. Indeed, otherwise deleting a minimum degree vertex from X would strictly increase 
the p-discrepancy. □ 


We shall prove the following, slightly more general form of Theorem [3l 


Theorem 10. Let G he a graph and let p € [0,1]. Then 


fp{G) > 


disCp (G) 

jp(G') 


and 


gp{G) > 


disCp(G) 


Proof. Let X be a subset of V (G) such that 6p{X) = disc^ (G). Then G[X] is p-full by Lemma 
El and in particular, fp{G) > |X|. By dehnition of p-jumbledness we have 


disCp (G) 


Sp{X) <jp{G)\X\<jp{G)fp{G). 


Applying the resulting lower bound for fp{G) to the complement G^ of G (and noting that 
disc“(G) = disc]Lp(G‘^) and jp{G) = ji-p(G'^)), we have 


9p{G) > max 


/ disCp (G) 
jpiG) 


disc+_p(G-)\ 

h-piG'^) ) 


disCp(G) 


□ 


5 Proof of Theorem |4] 


Let G = {V, E) be a graph on n vertices, and let fe : 1 < < n be an integer. 

p-jumbledness of G on k-sets is defined to be 


jk,p{G) = max 




XCV: |X|=fc |X| 

Similarly, the positive p-discrepancy of G on k-sets is defined to be 


disct (G) = max 6p{X), 
JCCV: \x\=k ^ 


The 


with the negative p-discrepancy on k-sets disc^p(G) and the p-discrepancy on k-sets discfc_p(G) 
defined mutatis mutandis. Note that by definition we have jp{G) = max{jfc^p(G) : 1 < k < n} 
and disCp(G) = max{discfc^p(G) : 1 < A; < n}. In addition for each k: 1 < k < n we have 
discfc^p(G) — k ■ jk,p{,Gf 


In general, it is not true that the jumbledness on fc-sets is of the same order as the global 
jumbledness of G. Indeed consider an Erdds-Renyi random graph G with edge probability 
p = 1 within which we plant a clique and a disjoint independent set, each of order m = ni. It 
is straightforward to show with k = \^~\ that asymptotically almost surely i(G) = 0(n^) 
whilst ji (G) = 0(m). To prove Theorem [H we use the following theorem of Thomason 221 
to show that every n-vertex graph of density p has a subgraph G* with n* = Ll{n) vertices 
such that the p-jumbledness of G* on [^]-sets and the global p-jumbledness of G* differ by 
a factor of only O(logn). 
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Proposition 11 (Thomason 2^). Let G be a graph of order n, let pn be an integer between 2 
and n — 2, and let M > 1 and p € [0,1]. Suppose that every set X of rjn vertices of G satisfies 


|5p(^)| < vna. 

Then G contains an induced subgraph G' of order 

such that G' is {p, Ma)-jumbled. 


5.1 Proof of g{n) = 

Let G = Go be a graph on n = no vertices with density p. Let g = \ and M = 880r7(l — 
gYfiogn + 1). For i > 0, if 

jr^^,p{Gi)<[^,jp{Gi)l (3) 

then apply Proposition [11] to find an induced subgraph Gj+i of G* on Uj+i > (1 — 
vertices with 


jp(Gj+i) < p(Gj). 


( 4 ) 


Combining (|3|) and (|H) and iterating, we see that 


jp{Gi+i) < < [gjp{Gi)\ < LV^^jp(Go)J < [e * ^(n/2)J. (5) 

Since jp(Gj+i) > 0, we deduce from ([5|) that this procedure must terminate for some i < logn 
with a graph G* = G* on Uj vertices where 


m 


> 1 - 


880 \ 
77(1 —j 


log n 


n = 1 — 


logn+1 j 


1 


log n 


n>f 


and where j|-i^]^p(G*) > ^j{G*). Applying Theorem fTOl to G* we have 


g{G) > gp{G*) > 


disCp(G*) 

3p{G*) 


> 


disC|-2Hi^p(Gj) ^ Ui ^ n 
eMj^p^-^^p{Gi) + 1 “ 4eM “ Ae^M 


Since M = 880r/(l — r 7 )^(logn + 1) = 0(logn), this completes the proof. 


□ 


5.2 Proof of g{n) = O ■ 

We shall base our argument on a result of Erdos and Pach [^. Using an unusual weighted 
random graph construction, they proved the following theorem: 
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Proposition 12 (Erdos-Pach (9|). For any v > 0 there exists c > 0 such that for all n 
sufficiently large there exists an n-vertex graph G in which every m-set of vertices with m > 
cn log log n/log n induces a subgraph with minimum degree strictly less than — 1 ) — 
and maximum degree strictly greater than ^{m — 1 ) + 


We note that the formulation of Erdos and Pach’s result given above is due to Kang, Pach, 
Patel and Regts, see Theorem 4 in 1^ and Section 3 in the same paper for an exposition 
of the delicate calculations involved. Erdos and Pach’s construction almost gives us what we 
want, namely a graph with no large full or co-full subgraph, with one caveat: it does not have 
density We circumvent this problem by taking disjoint copies of their construction and its 
complement and adding a carefully chosen random bipartite subgraph with density ^ between 
them. 


Eix V = and let 2 n € N be sufficiently large to ensure the existence of a graph G 2 n with 
the properties guaranteed by Proposition [T 2 j Let A and B be disjoint sets of 2n vertices, each 
split into n pairs. Given a pair in A and a pair in B, place one of the two possible matchings 
between them selected uniformly at random, and do this independently for each of the 
pairs (pair from A, pair from B). This gives a random bipartite graph H between A and B 
with density precisely Add a copy of G 2 n to A and a copy of its complement G^n to B to 
obtain a graph G* on 4n vertices with density exactly 

Let m = [cn log log n/log re], and let 2A = = mv. Let K be a set of at least 4m vertices 

in G*. Without loss of generality, assume that I = \ Y D B\ < \Y D A\ and thus |K n A| > 2m. 
Set X' to be the collection of vertices in K n A that have degree at most ^(|ynA| — 1) — A 
in K n A. There are at least A such vertices, for otherwise X = (K n A) \ X' is a set of 
at least 2m — A > m vertices inducing a subgraph of G 2 n with minimum degree at least 
\ (|K n A| — 1) — 2A > ^ (|X| — 1) — |X|7, a contradiction. Eor each pair from A discard if 
necessary one of its two vertices from X' to obtain a set X” C X' of at least A/2 vertices, each 
coming from a distinct pair. Note that by construction this means the degrees of the vertices 
from X" into Y r\ B are independent random variables with mean 

Eor Y to induce a full subgraph of G, each vertex in X' would need to have at least + A 
neighbours in Y n R. By standard concentration inequalities (e.g. the Chernoff bound), 
the probability that a given vertex in X" has that many neighbours in K n B is at most 
exp (—A^/2Z). By the independence noted above, the probability that all vertices in X" have 
the right degree in K n B is thus at most 

exp (^-^\^"\^ ^ exp > exp |^-ire“(logre)"^^ = o( 2 "‘^”). 

It follows that asymptotically almost surely there is no pair {X',Z) where X' C A is a 
collection of at least A vertices and Z C B are such that every vertex in X' has degree at 
least \\Z\ + A in Z. By symmetry, asymptotically almost surely no such pair exists either 
when X' Q B and Z C A, and in particular G* contains no full subgraph on 4m vertices. Still 
by symmetry, asymptotically almost surely the complement (G*)'^ also fails to contain a full 
subgraph on 4m vertices, and we deduce that g{G*) < 4m as desired. 
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The construction of G* shows g{^n) = O (4nloglog(4n)/log(4n)), and it is straightforward to 
adapt it to show that more generally gin) = O (n log log n/log n) when n ^ 0 mod 4. □ 


6 Proof of Theorem [T] 


Proof of fin,p) = 0(ni) for p = + cn~i and c > 1 fixed. Let n G N, and let p = + 

cn~i for some c > 1 be such that G N. Set 5 = cn~i. Take a complete (r + l)-partite 

graph with parts Si, S 2 , ■ ■ ■, Sr+i, and n vertices in each part, and for i = 1 , 2 ,..., r + 1 , add 
a clique Ti of size k m Si, such that 




2 + + 



A quick calculation shows k > yf^n for n sufficiently large. Delete edges from the Tj in 
an equitable manner as necessary to obtain a graph Gn on (r + l)n vertices with precisely 
edges. Supp ose H is a. full subgraph of Gn on m> [r + \)k vertices, induced by sets 
Xi C Si where Xi has size Si for i = 1,2,... ,r + l. Without loss of generality, we may assume 
Si = maxj Si > k. For a vertex v G Xi, let djiv) denote the number of neighbours of v in Xj. 
Then for x G Xi \ ViTi), which is non-empty since si > k. 


r+1 ^ 

dnix) = '^^djix) < m — Si < - -m. 

j=2 ^ 

On the other hand, since H is full, 

T 

dnix) > p{m — 1 ) =- m + 6m — p. 

r + 1 

It follows that 6m < p and thus m < 6~^p < c~^n^. However we had assumed that m > 
(r + l)k > r^y/cni. Taken together, our bounds for m imply < r~i , and in particular 
c < 1, a contradiction. Thus 

fiGn) <ir + l)k = 0 (^((r + l)n)S^ . 

This proves the second part of Theorem [TJ □ 


Proof of f{n,p) > \il — p)ini — 1 for p = pn ■ n~i < Pn < 1 — n~7. Let G be an n-vertex 
graph of density p. We shall repeatedly delete vertices of minimum degree to obtain a sequence 
of subgraphs G = Gi, G 2 , G 3 ,..., with Gi having n — i + 1 vertices. 

Let m = \^~\ and d* = [p(n — i)]. Note that di is the minimum degree required for Gj to be 
full. Let t be a positive integer so that (1 —p)~^n^ < 2* < 2(1 — p)“§nT and let be the 
remainder when di is divided by 2*. For at least of the values i : 1 < i < m, we have 

Uj < (1 — p) 2 *. At stage i < m of the algorithm, we delete a vertex of minimum degree from 
Gj. If for some i < m such that r, < (1 — p) 2 *, all n — i -|- 1 vertices in the graph G* have 
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degree at least dj — + 1, then, by Theorem [5] (or Theorem [S] applied t times), Gi has a ^-full 

subgraph H on N vertices, where 


n — i + 1 
2 * 


< N < 


n — i + 1 
2 * 


Write di = q2^ + . The minimum degree in H is 

di-ri + 1 - 


D > 


2 * 


n — i 1 


= q + l. 


( 6 ) 


For Ff to be a full subgraph of G we require D > p{N — 1). Now 


di Vi 

<^+p = q + ^+p, ( 7 ) 

which is at most q + 1 since r* < (1 — p) 2 h As this is at most our lower bound on IF, H is a 
full subgraph of G. Our choice of t ensures 


\vm > 


m 


> 


(1 - p)§n§ 


- 1 . 


On the other hand suppose that at every stage i < m of the greedy algorithm where r, < 
(1 — p) 2 *, we could remove a vertex of degree at most \p{n — i)] — r* and that at every other 
stage i < mwe could remove a vertex of degree at most \p{n — i)] — 1 (for otherwise we would 
have found a full subgraph on at least m vertices). Set I = {i < m : < (1 — p)2*}. We 

know that |I| > d-p)'" _ What is more, I can be divided into intervals of consecutive indices 
i of length at most (1 — p) 2 * • (|), and over each of these intervals r* takes each of the values 
1,2,... [(1 — p)2*J at least times. Indeed, suppose Vi-i = j + 1 and r* = j for some j > 1. 
Then there is a k: < k < ^ such that Pj/ = j for G + .. .i + k — 1} and Pj+fc = j — 1. 

By considering ^ Pj on these intervals and using m = [^], (1 —p)~^n3 < 2* < 2(1 — p)~in3, 
we get that: 



Then, by Theorem [2l we have 


/(G) > 



> 


(1 

4 


2 


This completes the proof of Theorem [TJ 


□ 
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Remark. We did not optimise the constants in the proof of f{n,p) = ri((l — since it 

is unlikely that this argument gives an asymptotically tight lower bound on f(n,p). Also note 
that for p = 1 — o{n~7), this lower bound on f{n,p) is superseded by that given in Theorem[2j 


7 Concluding remarks 


• We showed that f{n,p) = 0(n§) and that this bound is tight up to constants for many 
values of p. It remains an open problem to determine the order of magnitude of f{n,p) for 
each p = p{n). Similarly, we leave it as an open problem to determine the order of magnitude 
of g{n) and g{n,p), having proved g{n) = fI(n/logn) and g{n) = 0(n log log n/log n). 


• One motivation for studying relatively half-full subgraphs, apart from their use in the proof of 
Theorem [H is a random process on graphs known as majority bootstrap percolation: vertices 
of a graph are infected at time zero with probability p, and at any later time a vertex becomes 
infected if more than half of its neighbours are infected. A key quantity of interest in bootstrap 
percolation is the function 6p{G), which is the probability that the process on the graph G 
infects all the vertices in finite time. The quantity 9p{G) is precisely the probability that 
at time zero there is no relatively half-full subgraph of uninfected vertices. One may ask 
whether there exist p > 0 and c < 1 such that for every graph G, 9p{G) < c. In other words, 
is it the case that if the infection probability is too small (but still positive), then there is 
an absolute positive probability that on any graph G we fail to infect any vertex from some 
relatively half-full subgraph (and hence that the infection does not spread to all vertices of 
G)1 Recently, this was answered in the negative by Mitsche, Perez-Gimenez and Pralat 16l |. 
who given any arbitrarily small p > 0 constructed a sequence of regular graphs Gi,G 2 , - ■ ■ 
such that liningao 9p{Gn) = 1. Another question, which remains open, is whether the number 
of relatively half-full subgraphs of an n-vertex graph grows at rate exp(0(n)) as n ^ oo. 


• Hypergraphs. If G is an n-vertex r-uniform hypergraph of density p, then an m-vertex 
subgraph R of G is full if for every v S V{H), dniv) > p{'^Zi)- Write /(G) for the order 
of a largest full subgraph of G and f^{n,p) for the minimum of /(G) over all r-uniform 
hypergraphs G on n vertices with density p. The problems studied in this paper can be 
generalised to uniform hypergraphs, and in particular, one may ask for the order of /^(n,p). 
It is straightforward to imitate the proof of the upper bound in Theorem [1] to show that for 
certain values of p G (0,1) that f^{n,p) = O(n^). We leave open the problem of determining 
the tightness of this upper bound, as well as of giving bounds on /'’(n,p) when p decays as n 
grows: 


Problem 6. Determine the order of magnitude of f^{n,p) for p = p{n) and r >3. 


The greedy algorithm from Section [3] generalises to r-uniform hypergraphs, in which it yields a 
full subgraph of order only . It is an open question as to how the results on relatively g-full 
subgraphs in Section 11.61 may be extended to r-uniform hypergraphs. On the other hand. 
Proposition [TT] was generalised to hypergraphs by Haviland and Thomason , and using 
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their result, the proof of the lower bound in Theorem 0] extends to the setting of r-uniform 
hypergraphs exactly as before. In particular, every n-vertex r-uniform hypergraph contains a 
full or a co-full subgraph with II(n/logn) vertices. 

• Digraphs. We could also ask about directed graphs. Since every subgraph of a transitive 
tournament has a vertex of in-degree zero and a vertex of out-degree zero, it is more fruitful 
to ask about extensions of Theorem [5] than of Theorem [TJ Let d'jj{v) denote the out-degree 
of a vertex v € H. A subgraph iL of a directed graph D is relatively q-out-fuH if for every 
u € if we have d'jj{v) > qd'^{v). Then the problem is to determine the smallest function 
h{n, q) such that every digraph on n vertices has a relatively g-out-full subgraph with at most 
h{n, q) vertices. 

• Weighted graphs. A weighted graph is a pair W = {V,w), where w : [0,1] is 

a weighting of pairs of vertices from V. The density p of IT is then the average pair-weight 
under re, and the degree dyix) of x in a subset T C T is the sum over all ?/ € T of w{{x, y}). 
Our definition of full subgraphs carries over to the weighted graph setting in the natural way, 
and we can ask: 

Problem 7. Let W be a weighted graph. Determine tight lower bounds for the order of a 
largest full subgraph of W. 

We may similarly ask about relatively g-full subgraphs for weighted graphs. The proof of 
Theorem [7] extends naturally to this setting: running through the same argument as before, 
we obtain a new equality 

M' = M + {l- q)d{Y) - dviy) + qd{x) - dx{x) - w{{x,y}). 

From the maximality of M = (1 — q)e{X) + qe(Y) over all weighted bipartitions, we deduce, 
in replacement of conditions (a)-(c), that 

qd{x) - dx{x) <w{{x,y}) and {I - q)d{y) - dyiv) < w{{x,y}). 

This implies as before that X U {y} is relatively g-full on [gn] -|- 1 vertices and Y U {a;} is 
relatively (1 —g)-full on [(1 —q)nj -|-1 vertices. Weighted analogues of Theorems [5] and [8] follow 
as immediate corollaries. However, this does not quite allow us to prove a weighted version of 
Theorem [TJ in the weighted setting, degrees need not be integers and so the minimum degree 
lower bound (|6|) becomes D > q, while the upper bound on the degree requirement for being 
a full subgraph in ([7]) remains q + 1 — p2“L Thus our argument in this case only yields a 
‘weakly’ full subgraph on N vertices (i.e. with minimum degree greater than p(N — 1) — 1, 
rather than p{N — 1)). Is this the best that can be done? We leave this as an open problem. 

We note that the proofs of Theorems [2] and [3| carry over to the weighted setting without any 
changes. To recover a weighted analogue of our result on full and co-full subgraphs. Theorem [TJ 
a weighted version of Thomason’s theorem (Proposition [TT|) would be needed. This problem 
too is left open. 

• Computational complexity. It appears likely that the following computational problems, 
which we have not investigated, are of similar complexity to Max Cut: 
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(i) find a largest full subgraph of G; 

(ii) given an integer k, determine whether or not G contains a full subgraph on k vertices; 

(iii) given an integer k, find a /c-vertex subgraph with largest minimum degree. 


The problem of finding (an approximation to) the densest subgraph of order k has received 
a significant amount of attention from the computer science community (see e.g. [1, [l^i. 
including in some variants involving degree constraints [^. Prom an algorithmic perspective, 
we thus expect that problems (i)-(iii) above will be hard: densest subgra ph of order k is 
known to be NP-Hard. Further, examples due to Schaffer and Yannakakis 20| and Monien 
and Tscheuschner [l^ for weighted versions of the Max-Cut problem suggest that local search 
(i.e. algorithms based on flipping vertices between a fc-set X whose minimum degree we are 
trying to maximise and its complement) could take exponential time to converge to a local 
optimum for problem (iii) (see also the work of Poljak 18|]). We note that the proofs of 
Theorems ECIE] and [8] yield polynomial time algorithms in each case. 
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